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Abstract 

Reaction systems are a formal model that has been introduced to in- 
vestigate the interactive behaviors of biochemical reactions. Based on the 
formal framework of reaction systems, we propose new computing mod- 
els called reaction automata that feature (string) language acceptors with 
multiset manipulation as a computing mechanism, and show that reaction 
automata are computationally Turing universal. Further, some subclasses 
of reaction automata with space complexity are investigated and their lan- 
guage classes are compared to the ones in the Chomsky hierarchy. 



1 Introduction 

In recent years, a series of seminal papers [71 [HI 12] has been published in which 
Ehrenfeucht and Rozenberg have introduced a formal model, called reaction 
systems, for investigating interactions between biochemical reactions, where two 
basic components (reactants and inhibitors) are employed as regulation mecha- 
nisms for controlling biochemical functionalities. It has been shown that reaction 
systems provide a formal framework best suited for investigating in an abstract 
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level the way of emergence and evolution of biochemical functioning such as 
events and modules. In the same framework, they also introduced the notion 
of time into reaction systems and investigated notions such as reaction times, 
creation times of compounds and so forth. Rather recent two papers [THl [H] 
continue the investigation of reaction systems, with the focuses on combina- 
torial properties of functions defined by random reaction systems and on the 
dependency relation between the power of defining functions and the amount of 
available resource. 

In the theory of reaction systems, a (biochemical) reaction is formulated as a 
triple a — (R a , la. Pa), where R a is the set of molecules called reactants, I a is the 
set of molecules called inhibitors, and P a is the set of molecules called products. 
Let T be a set of molecules, and the result of applying a reaction a to T, denoted 
by res a (T), is given by P a if a is enabled by T (i.e., if T completely includes 
R a and excludes I a ). Otherwise, the result is empty. Thus, res a (T) = P a 
if a is enabled on T, and res a (T) — otherwise. The result of applying a 
reaction a is extended to the set of reactions A, denoted by resA(T), and an 
interactive process consisting of a sequence of res a (I 1 ) 's is properly introduced 
and investigated. 

In the last few decades, the notion of a multiset has frequently appeared 
and been investigated in many different areas such as mathematics, computer 
science, linguistics, and so forth. (See, e.g., [2] for the reference papers written 
from the viewpoint of mathematics and computer science.) The notion of a 
multiset has received more and more attention, particularly in the areas of 
biochemical computing and molecular computing (e.g., [T9l [23]). 

Motivated by these two notions of a reaction system and a multiset, in 
this paper we will introduce computing devices called reaction automata and 
show that they are computationally universal by proving that any recursively 
enumerable language is accepted by a reaction automaton. There are two points 
to be remarked: On one hand, the notion of reaction automata may be taken 
as a kind of an extension of reaction systems in the sense that our reaction 
automata deal with multisets rather than (usual) sets as reaction systems do, in 
the sequence of computational process. On the other hand, however, reaction 
automata are introduced as computing devices that accept the sets of string 
objects (i.e., languages over an alphabet). This unique feature, i.e., a string 
accepting device based on multiset computing in the biochemical reaction model 
can be realized by introducing a simple idea of feeding an input to the device 
from the environment and by employing a special encoding technique. 

In order to illustrate an intuitive idea of the notion of reaction automata 
and their behavior, we give in Figure Q] a simple example of the behavior of a 
reaction automata A that consists of the set of objects {po,Pi, a, b, a', /} (with 
the input alphabet {a, b}), the set of reactions {a = (po, {a, b, a'}, /), a! = 
(Poa, {b},Poa'),a 2 = (p a% 0,pi), a 3 = (pia'6, {o},Pi), a 4 = (pi, {a, b, a'}, /)}, 
where {pa} is the initial multiset and {/} is the final multiset. Note that in a 
reaction a = (R a ,I a ,P a ), multisets R a and P a are represented by string forms, 
while I a is given as a set. In the graphic drawing of Figure [T] each reaction a^ 
is applied to a multiset (of a test tube) after receiving an input symbol (if any 
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Figure 1: A graphic illustration of interactive biochemical reaction processes for 
accepting strings in the language L — {a n b n | n > 0} in terms of our reaction 
automaton A. 

is provided) from the environment. In particular, applying ao to {po} leads to 
that the empty string is accepted by A. It is seen, for example, that reactions 
ai and a2 are enabled by the multiset T = {po,a',a'} only when inputs a 
and b, respectively, are received, which result in producing R\ = {pq, a', a', a'} 
and i?2 = {pi,a'}, respectively. Thus, we have that res ai (TU {a}) — R\ and 
res a2 (TU{b}) = i?2 ■ Once applying a 2 has brought about a change of po into p\ , 
A has no possibility of accepting further inputs a's, because of the inhibitors in 
a 3 or a 4 . One may easily see that A accepts the language L = {a n b n | n > 0}. 
We remark that reaction automata allow a multiset of reactions a to apply 
to a multiset of objects T in an exhaustive manner (what we call maximally 
parallel manner), and therefore the interactive process sequence of computation 
is nondeterministic in that the reaction result from T may produce more than 
one product. The details for these are formally described in the sequel. 

This paper is organized as follows. After preparing the basic notions and 
notations from formal language theory in Section 2, we formally introduce the 
main notion of reaction automata together with one language example in Section 
3. Then, Section 4 describes a multistack machine (in fact, two-stack machine) 
whose specific property will be demonstrated to be very useful in the proof of 
the main result in the next section. Thus, in Section 5 we present our main 
results: reaction automata are computationally universal. We also consider 
some subclasses of reaction automata from a viewpoint of the complexity theory 
in Section 6, and investigate the language classes accepted by those subclasses 
in comparison to the Chomsky hierarchy. Finally, concluding remarks as well 
as future research topics are briefly discussed in Section 7. 

2 Preliminaries 

We assume that the reader is familiar with the basic notions of formal language 
theory. For unexplained details, refer to [13]. 

Let V be a finite alphabet. For a set U C V, the cardinality of U is denoted 
by \U\. The set of all finite- length strings over V is denoted by V* . The empty 
string is denoted by A. For a string x in V* , \x\ denotes the length of x, while 
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for a symbol a in V we denote by \x\ a the number of occurences of a in X. For 
k > 0, let prefkix) be the prefix of a string x of length k. For a string u> = 
ai<22 • • • a n £ V*, w R is the reversal of w, that is, {a-ya^ ■ ■ ■ a n ) R = a n ■ ■ • a%a\. 
Further, for a string x = a\02 ■ ■ ■ a n £ V* , x denotes the hat version of x, i.e., 
x — • ■ ■ a n , where each &j is in an alphabet V = {a \ a £ V} such that 

vr\V = %. 

We use the basic notations and definitions regarding multisets that follow [H 
[T5] . A multiset over an alphabet V is a mapping : V — > N, where N is the 
set of non-negative integers and for each a £ V, fj,(a) represents the number of 
occurrences of a in the multiset fi. The set of all multisets over V is denoted by 
V*, including the empty multiset denoted by where /J,\(a) = for all a £ V. 
A multiset fi may be represented as a vector, n(V) = (/i(ai), . . . , /i(a„)), for an 
ordered set V = {ai, . . . ,a n }- We can also represent the multiset (i by any 
permutation of the string — ■ ■ -a^ a "^. Conversely, with any string 

x £ V* one can associate the multiset \i x : V — > N defined by fi x (a) = \%\a 
for each a £ V . In this sense, we often identify a multiset /i with its string 
representation or any permutation of . Note that the string representation 
of n\ is A, i.e., w^ x — A. 

A usual set U C V is regarded as a multiset flu such that [i v (a) = 1 if a is 
in f7 and fJ,u( a >) = otherwise. In particular, for each symbol a £ V, a multiset 
/Ltx a \ is often denoted by a itself. 

For two multisets fi±, fi2 over V, we define one relation and three operations 
as follows: 

Inclusion : /Ltj C /x 2 iff /ii(a) < ^2 ( a )> f° r each a G V, 
iSum : (/xj + M2)( a ) = A*i( a ) + / i 2( a ), for each a € V, 

Intersection : (/ij PI /L«2)(a) = min{/ii(a), /12(a)}, for each a € V, 

Difference : (/ii — /i2)(a) = Mi (a) — ^2(0-), for each a€7 (for the case ^2 C /i! only). 

A multiset ^i is called multisubset of /12 if Mi 5= A*2- The sum for a family of 
multisets = {Mi}iei is also denoted by Yliel -^ or a mu hiset [i and n G N, 
fJL n is defined by ^ n (a) = n ■ /i( a ) f° r each a £ V. The weight of a multiset /U is 
ImI = E ae yM(a)- 

We introduce an injective function stm : V* — > that maps a string to a 
multiset in the following manner: 



3 Reaction Automata 

As is previously mentioned, a novel formal model called reaction systems has 
been introduced in order to investigate the property of interactions between 
biochemical reactions, where two basic components (reactants and inhibitors) 
are employed as regulation mechanisms for controlling biochemical functional- 
ities ([3 HI [9]). Reaction systems provide a formal framework best suited for 




(for n > 1) 
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investigating the way of emergence and evolution of biochemical functioning on 
an abstract level. 

By recalling from [7J basic notions related to reactions systems, we first 
extend them (defined on the sets) to the notions on the multisets. Then, we 
shall introduce our notion of reaction automata which plays a central role in 
this paper. 

Definition 1. For a set S, a reaction in S is a 3-tuple a = (i? a , I a , P a ) of finite 
multisets, such that i? a , P a G I a C S and R a !~l I a = 0. 

The multisets R a and P a are called the reactant of a and the product of a, 
respectively, while the set I a is called the inhibitor of a. These notations are 
extended to a multiset of reactions as follows: For a set of reactions A and a 
multiset a over A, 

R a = Y. Ia = (J 7 - P « = E P a (a) - 

In what follows, we usually identify the set of reactions A with the set of 
labels Lab(A) of reactions in A, and often use the symbol A as a finite alphabet. 

Definition 2. Let A be a set of reactions in S and a G A# be a multiset of 
reactions over A. Then, for a finite multiset T G S*, we say that 
(f ) a is enabled by T if R a C T and 7 a n T = 0, 

(2) a is enabled by T in maximally parallel manner if there is no (3 G such 
that aCft and a and /3 are enabled by T. 

(3) By En p A (T) we denote the set of all multisets of reactions a G A# which are 
enabled by T in maximally parallel manner. 

(4) The results of A on T, denoted by Rcsa{T), is defined as follows: 

Res A {T) = {T-R a +P a \ae En p A (T)}. 

Note that we have Res A (T) = {T} if En p A (T) = 0. Thus, if no multiset of 
reactions a G is enabled by T in maximally parallel manner, then T remains 
unchanged. 

Remarks 1. (i) It should be also noted that the definition of the results of A 
on T (given in (4) above) is in contrast to the original one in [7], because we 
adopt the assumption of permanency of elements: any element that is not a 
reactant for any active reaction does remain in the result after the reaction. 
(ii) In general, En A (T) may contain more than one element, and therefore, so 
may Rcsa(T). 

(Hi) For simplicity, I a is often represented as a string rather than a set. 

Example 1. Let S = {a, b, c, d, e] and consider the following set A = {a, b, c} 
of reactions in S: 

a = (6 2 , a, c), b = (c 2 , 0, b), c = (be, d, e). 
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(i) Consider a finite multiset T = b A cd. Then, ct\ — a is enabled by T, while 
neither b nor c is enabled by T, because Rb % T and I c n T ^ 0. Further, 
a 2 = a 2 is not only enabled by T but also enabled by T in maximally parallel 
manner, because no j3 with 012 C /3 is enabled by T. Since i? a 2 = 6 4 , P a 2 = c 2 , 
and En p A (T) = {a 2 }, we have 



(m) Consider T' = b 3 c 2 e. Then, /?i = ab and /?2 = ac are enabled by X", while 
be is not. Further, it is seen that both /3\ and /?2 are enabled by T' in maximally 
parallel manner, and En p A (T') = {ab, ac}. Thus, we have 



If we take T" = bed, then none of the reactions from A is enabled by T". 
Therefore, we have Res A {T") = T" . 

We are now in a position to introduce the notion of reaction automata. 

Definition 3. (Reaction Automata) A reaction automaton (RA) A is a 5-tuple 
A = {S, S, A, D , /), where 

• S is a finite set, called the background set of A, 

• S(C S) is called the input alphabet of A, 

• A is a finite set of reactions in S, 

• £>o £ S# is an initial multiset, 

• f G S is a special symbol which indicates the final state. 

Definition 4. Let A = (S, S, A, Do, /) be an RA and w = a\ • ■ • a n G S*. An 

interactive process in A with input w is an infinite sequence 7r = Do ,...,£)$,..., 
where 



By I-P(»4, ui) we denote the set of all interactive processes in A with input w. 

In order to represent an interactive process it, we also use the "arrow nota- 
tion" for 7r : (01, Do) —>••••—> (a„, D„_i) — > (D„) — > (D„ + i) —>••••, or alterna- 
tively, D Di D 2 ^ a "-! D n _i D„ -> D„ +1 

For an interactive process 7r in A with input w, if En A (D m ) = for some 
m > |io|, then we have that ResA(D m ) = {D m } and D m = D m+i = ■ • ■ . In 
this case, considering the smallest m, we say that n converges on D m (at the 
m-th step). When an interactive process ir converges on D m , each Di of tt is 
omitted for i > m + 1. 



Res A (T) = {T- R a 2 + P a2 } = {c 3 d}. 



Res A (T') = {b 2 ce,c 2 e 2 }. 
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Figure 2: (a) Reaction diagram: Interactive processes for accepting a 2 , a 4 and 
a 8 in A. Some arrows are associated with a multiset of reactions applied at the 
step. 

Definition 5. Let A = (S, E, A, Do, /) be an RA. The language accepted by A, 
denoted by L(A), is defined as follows: 

L(A) — {w e E* | there exists ir 6 IP(A, w) that converges on 

D m at the m-th step for some m > \w\, and / C D m }. 

Example 2. Let us consider a reaction automaton A = (S, E, A, Dq, /) defined 
as follows: 

S = {a, b, c, d, e, /} with E = {a}, 
A = {ai,a2,a 3 ,a4,a 5 ,a 6 }, where 

ai = (a 2 , 0, 6), a 2 = (6 2 , ac, c), a 3 = (c 2 , 6, 6), 

a4 = (bd, ac, e), as = (cd, b, e), ag = (e, abc, /), 
A) = d. 

Let w = aaaaaaaa 6 S** be the input string and consider an interactive process 
7T such that 

7r : d y a ad — ^ a bd -> a abd b 2 d ab 2 d ^ a b 3 d -> a ab 3 d ^ a b 4 d ->c 2 d^bd^e^ f. 

It can be easily seen that ir S IP{A,w) and w G L(A). Figure [2] illustrates 
the whole view of possible interactive processes in A with inputs a 2 , a 4 and a 8 . 
For instance, since a 2 , S En p A (b 4 d), it holds that c 2 d € ResA(b 4 d). Hence, the 
step & 4 <i — ► c 2 d is valid. We can also see that L(A) = {a 2 \ n > 1} which is 
context-sensitive . 

4 Multistack Machines 

A multistack machine is a deterministic pushdown automaton with several 
stacks (j!3j). It is known that a two-stack machine is equivalent to a Turing 
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machine as a language accepting device. 

A fc-stack machine M = (Q, E, T, S,po, Z , F) is dchncd as follows: Q is a set 
of states, E is an input alphabet, T is a stack alphabet, Zq = (Zoi, Z02, ■ • • , ^ofe) 
is the fc-tuple of the initial stack symbols, po G Q is the initial state, F is a set of 
final states, 6 is a transition function defined in the form: S(p, a, X\, X%, . ■ ■ , Xk) = 
(9,71,72, . . . ,7fc), where p, q G Q, a e E U {A}, Xj £ T, 7, G T* for each 
1 < i < fc. This rule means that in state p, with Xi on the top of i-th stack, 
if the machine reads a from the input, then go to state q, and replace the the 
top of each i-th. stack with 7, for 1 < i < k. We assume that each rule has a 
unique label and all labels of rules in 6 is denoted by Lab(S). Note that the 
fc-stack machine can make a A-move, but there cannot be a choice of a A-move 
or a non- A-move due to the deterministic property of the machine. The /c-stack 
machine accepts a string by entering a final state. 

In this paper, we consider a modification on a multistack (in fact, two-stack) 
machine. Recall that in the simulation of a given Turing machine TM with an 
input w = a\a,2 • ■ • at in terms of a multistack machine M, one can assume the 
following (see [13]): 

(i) At first, two-stack machine M is devoted to making the copy of w on 
stack-2. This is illustrated in (a) and (b)-l of Figure [3J for the case of 
k = 2. M requires only non-X-moves. 

(ii) Once the whole input w is read-in by M, no more access to the input tape 
of M is necessary. After having w R on stack-2, M moves over w R (from 
stack-2) to produce w on stack-1, as shown in (b)-2. These moves only 
require A-moves and after this, each computation step of M with respect 
to w is performed by a A-move, without any access to w on the input tape. 

(Hi) Each stack has its own stack alphabet, each one being different from the 
others, and a set of final states is a singleton. Once M enters the final 
state, it immediately halts. Further, during a computation, each stack is 
not emptied. 

Hence, without changing the computation power, we may restrict all computa- 
tions of a multistack machine that satisfies the conditions (i), (ii), (Hi). We call 
this modified multistack machine a restricted multistack machine. 

In summary, a restricted A;-stack machine M r is composed by 2fc + 5 elements 
as follows: 

M r — (Q, E, r 1; T 2 , . . . , T kl 5,p Q , Z i,Z 2, ■ ■ ■ , z ok ,f), 

where for each 1 < i < k, Z^i £ Ti is the initial symbol for the i-th stack used 
only for the bottom, / G Q is a final state, and its computation proceeds only 
in the above mentioned way (i), (ii), (Hi). Especially, A-moves are used after 
all non- A- moves in a computation of M r . 

Proposition 1. (Theorem 8.13 in |13j ) Every recursively enumerable language 
is accepted by a restricted two-stack machine. 
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Figure 3: (a) Turing machine (TM); (b)Two-stack machine M simulating TM, 
where $ is the end marker for the input. 



5 Main Results 

In this section we shall show the equivalence of the accepting powers between 
reaction machines and Turing machines. Taking Proposition 1 into considera- 
tion, it should be enough for the purpose of this paper to prove the following 
theorem. 

Theorem 1. If a language L is accepted by a restricted two-stack machine, then 
L is accepted by a reaction automaton. 

[Construction of an RA] 

Let M — (Q, S, Ti, T2, S,po, Xo, Yo, /) be a restricted two-stack machine with 
Ti = {X ,X u ...,X n }, T 2 = {Y ,Y u ...,Y m }, n,m > 1, where T = T 1 UT 2 , 
Xq and Yq are the initial stack symbols for stack-l and stack-2, repsectively, 
and we may assume that Ti n T 2 = 0. 

We construct an RA Am = (5, S, A, _D , /') as follows: 

S' = guQUEurufu Lab(S) u {/'}, 

A = A U A a U A a U A x U A x U A x U A x U A Y U A Y U A f U A f , 
D =p X Y , 
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where the set of reactions A consists of the following 5 categories : 



(1)^0 : 


= {(p aX Q Y 0l Lab(5),q ■ stm(x) ■ stm(y) ■ r') \ r : 8{p , a, X a ,Y ) = 


(q,x,y), r' G Lab(S)}, 

\ J. 7 7 tS / / \ / J 7 


(2)A a 


= {(paXiYjr, r, q ■ stm(x) ■ stm(y) ■ r ) | a € E, r : S(p, a, X;, 


Y j) = 


(q,x,y), r' G Lab(S)}, 


A a 


= {{paXiYjr, r, q ■ stm(x) ■ stm(y) ■ r ) | a G E, r : S(p, a, JQ, 


Yj) = 


(a.x.v). v G Lab(S)\, 


(3)A X 


= {(pXiYjr, E U T, q ■ stm(x) ■ stm(jj) ■ r')\r : S(j>, A, X^Yj) 


= (q, 


x,y), r' G Lab{5)}, 


A x 


= {(pXiYjr, E U r, q ■ stm(x) ■ stm(y) ■ r ) | r : 5{p, A, Xj, Yj) 




x, y), r' G Lab(S)}, 


(4)A X 


= {(Xl gufu (Lab(S) - {r}) U {/'}, Xt l ) < k < n, r 


:5(p, 


a,Xi,Yj) = (q,x,y)}, 


A x 


= {(XlQuru (Lab(S) - {r}) U {/'}, xf ' ) < k < n, r 


:8(p, 


a,Xi,Yj) = (q,x,y)}, 


A Y 


= {(Y k 2 7 Q U f U (Lab(S) - {r}) U {/'}, ) \ < k < m, r 


:S(p, 


a,X h Yj) = (q,x,y)}, 


A y 


= {(Y k 2 7 Q uru (Lab(S) - {r}) U {/'}, Y^ )\0<k<m,r 


:S(p, 


a,X h Yj) = (q,x,y)}, 




= {(/ ) r,/ / )} ) 






Af 


= {(/,r,/')}. 







Proof. We shall give an informal description on how to simulate M with an 
input w = a\ai ■ ■ ■ Oug in terms of Am constructed above. 

M starts its computation from the state po with Xq and Yq on the top of 
stack-1 and stack-2, respectively. This initial step is performed in Am by ap- 
plying a reaction in Aq to D$ — pqXqYq together with a\. In order to read the 
whole input w into Am, applying reactions in (2) and (4) leads to an interactive 
process in Am ■ -Do — > ai D\ — > a ' 2 D 2 — > a3 ■ ■ ■ — > ai D £ , where Di just corre- 
sponds to the configuration of M depicted in (b)-l of Figure[3] After this point, 
only reactions from (3), (4) and (5) are available in Am, because M makes only 
A-moves. 

Suppose that for k > 1, after making fc-steps M is in the state p and has 
ctfc G T| and G T 2 on the stack-1 and the stack-2, respectively. Then, from 
the manner of constructing A, it is seen that in the corresponding interactive 
process in Am, we have : 

{Dk = P • stm{otk) ■ stm((3k) • r (if k is even) 
Dk — P • stm(&k) ■ stm((3k) ■ r (if k is odd) 

for some r G Lab(S), where the rule labeled by r may be used at the (k + l)-th 
step. (Recall that stm(x) is a multiset, in a special 2-power form, representing 
a string x.) Thus, the multisubset u stm(ak)stm((3k)" in D k is denoted by the 
strings in either T* or T* in an alternate fashion, depending upon the value k. 
Since there is no essential difference between strings denoted by T* and its hat 
version, we only argue about the case when k is even. 

Suppose that M is in the state p and has a — Xn ■ ■ ■ XuXq on the stack-1 
and /3 = Yji ■ ■ ■ Yj S Yg on the stack-2, where the leftmost element is the top sym- 
bol of the stack. Further, let r be the label of a transition 5(p, a,k+i,Xn, Yj%) — 
(q,x,y) (if 1 < k < I - 1) or 5(p, A, Xn,Yji) = (q,x,y) (if / < fc) in M to 
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be applied. Then, the two stacks are updated as a' = xX{ 2 - ■ ■ Xi t X$ and 
/?' = yYj 2 ■ ■ ■ Yj s Yo. In order to simulate this move of M, we need to prove that 
it is possible in Am, D k -> a *+i D k+1 (if 1 < k < I - 1) or D k -> D k+1 (if I < k), 
where 

D k =p - stm(X il X l2 ■ ■ ■ X lt X ) ■ stm{Y j iY :j 2 ■ ■ ■ Y js Y )r 
D k+ i = q ■ stm{xX i2 ■ ■ ■ X it X ) ■ stm(yY j2 ■ ■ ■ Y JS Y )r' 

for some r' e Lab(5). Taking a close look at D k , we have that 

D k = P x tl Y Jir ■ xf 2 x£ • • • xt'xf ■ y^y% ■ ■ ■ r/r V 2S , 

from which it is easily seen that a multiset of reactions z = rxj2 • • • x? t * 2 Xp* 1 yj2 • 
is in En p AM (a k+1 + D k ) (if 1 < k < I - 1) or in En p AM (D k ) (if I < k), i.e., it is 
enabled by a k+ \ + D k (if 1 < k < I — 1) or D k (if I < k) in maximally parallel 
manner, where 

{r = (pa k+ iXnYj\r, T, q ■ stm(x)stm(y)r') e A a (if 1 < k < I — 1) 
r = (pXnYjir, S U f, q ■ stm{x)stm{y)r') G A A (if I < k), 

for some r' e Lab{5) 1 

x, = (Xf, gufu La6(5) - {r} U {/'}, l^' ) £ A x (for i = 0, i2, . . . , it), 

Yj = (F/,Qufu La6(J) - {r} U {/'}, Y^ ) e Ay (for j = 0, j2, . . . ,js). 
The result of the multiset of the reactions z is 

, , , , , " n\x\ - n t-2+\x\ * t — l+\x\ * n\x\ * OS — 2+1x1 ^os-l+|cc| 

q ■ stm(x)stm(yy ■ X% • • • X 2 X 2 ■ Y* ■ ■ ■ Y* Yq 2 

= g • stm(xX i2 ■ ■ ■ X it X a ) ■ stm(yY ]2 ■ ■ ■ Y js Y )r' 
= D k+ i 

Thus, in fact it holds that D k ^ a *+i D k+l (if 1 < k < I - 1) or D k ->■ D fe+ i (if 
Z < fc) in Am- 

We note that there is a possibility that undesired reaction r' can be enabled 
at the (k + l)th step, where r' is of the form 

f r' = {pa k+1 X iu Y jv r, f , • «tm(i')«*m(l/ / )r / ) € A a (if 1 < k < I - 1) 
\ r' = (p^Y^r, S U f , g' • stm(x')stm(y')r') 6 A A (if I < k), 

with u ^ 1 or w / 1, that is, the reactant of r' contains a stack symbol which 
is not the top of stack. If a multiset of reactions z' = r'x'j • • • x^y^ • • • y^, with 
x' x , . . . ,x' t , E Ax, y'i, • • • ,y' s , £ Ay is used at the (k + l)th step, then D k+ i 
contains both the symbols without hat (in T) and the symbols with hat (in Q 
and r). This is because in this case, Xn or Yji in D k which is not consumed at 
the (k + l)-th step remains in D k+ i (since the total numbers of Xn and Yji are 
odd, these objects cannot be consumed out by the reactions from (4)). Hence, 
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no reaction is enabled at the (k + 2)-th step and /' is never derived after this 
wrong step. 

From the arguments above, it holds that for an input w E E*, M enters 
the final state / (and halts) if and only if there exists it : Do, . . . , Di, . . . E 
IP(Am,w) such that D^-i contains / or /, D k contains /', and it converges 
on Dk, for some fc > 1. Therefore, we have that L{M) = L{Am) holds. 

□ 

Corollary 1. Every recursively enumerable language is accepted by a reaction 
automaton. 

Recall the way of constructing reactions A of Am in the proof of Theorem 
1. The reactions in categories (1), (2), (3) would not satisfy the condition of 
determinacy which is given immediately below. However, we can easily modify 
Am to meet the condition. 

Definition 6. Let Am = (S, E, A, D , /') be an RA. Then, Am is deterministic 
if for a = (R, I, P),a' = {R' , V, P') € A, {R = R') A (I = I') implies that a = a'. 

Theorem 2. If a language L is accepted by a restricted two-stack machine, then 
L is accepted by a deterministic reaction automaton. 

Proof. Let M — (Q, E, Ti, r 2 , S,p , X , Y , /) be a restricted two-stack machine. 
For the RA Am = (S, E, A, Do, /') constructed for the proof of Theorem 1, we 
consider A' M = (S U Lab(S), E, A', D , /'), where A' consists of the following 5 
categories : 

(1) A) = {(poaX Q Y , Lab(S) U {r'}, q ■ stm(x) ■ stm(y) -r')\r : S(p , a, X , Y ) = (q, x, y), r' G Lab(S)}, 

(2) A a = {(paXiYjr,t U {f'},q ■ stm(x) ■ stm(y) ■ r')\a E E, r : 5{p,a,X t ,Y j ) = (q,x,y), r' E Lab(S)}, 
A a = {(paXiYjr, T U {/}, q ■ stm(x) ■ stm(y) ■ r') \ a € E, r : S(p,a, X l ,Y j ) = (q, x, y), r' E Lab(S)}, 

(S)A X = {{pX^r, E U f U {?}, q ■ stm(x) ■ stm(y) ■ r>) \ r : 6(p, A, X u Yj) = (q, x, y), r' E Lab(S)}, 
A x = {(pXiY jr ,i:uTU{r'},q- stm(x) ■ stm(y) ■ r')\r : 5(p,\,X l ,Y j ) = (q,x,y), r' E Lab(S)}, 

(4) A X = {(Xl, gufu (Lab(S) - {f}) U {/'}, Xf ' ) < k < n, r : S(p, a, X t , Yj) = (q, x, y)}, 
A x = {{Xl guru (Lab(6) - {r}) U {/'}, ) < k < n, r : S(p, a, X t , Yj) = (q, x, y)}, 
A Y = {(Y k 2 ,Qufll(Lab(S) - {f}) U {/'}, Y^ ) \ < k < m, r : S(p, a, X i} Yj) - (q,x,y)}, 
A Y = {(Y k 2 ,Qliru(Lab(6) - {r}) U {/'}, Y? M ) \ < k < m, r : 5{p,a,Xi,Yj) - (q,x,y)}, 

(5) A f = {(f,t,f% 

i/ = {(/,r,/')}. 

The reactions in categories (1), (2), (3) in A' meet the condition where 
A' m is deterministic, since the inhibitor of each reaction includes r' or r'. We 
can easily observe that the equation L(M) = L(A' m) is proved in the manner 
similar to the proof of Theorem 1. 

□ 
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Corollary 2. Every recursively enumerable language is accepted by a determin- 
istic reaction automaton. 

6 Space Complexity Classes of RAs 

We now consider space complexity issues of reaction automata. That is, we 
introduce some subclasses of reaction automata and investigate the relationships 
between classes of languages accepted by those subclasses of automata and 
language classes in the Chomsky hierarchy. 

Let A be an RA and / be a function defined on N. Motivated by the notion 
of a workspace for a phrase-structure grammar ([21]), we define: for w € L(A) 
with n = \w\, and for tt in IP {A, w) that converges on D m for some m > n and 
D m includes the final state, 

WS{w, 7r) = max{|_D,;| | Di appears in ti }. 

i 

Further, the workspace of A for w is defined as: 

WS(w, A) = min{WS'(u;, tt) | tt G IP(A, w)}. 

7T 

Definition 7. (i). An RA A is f(n)-bounded if for any w G L(A) with n = \w\, 
WS(w,A) is bounded by f{n). 

(ii). If a function f(n) is a constant k (resp. linear, polynomial, exponential), 
then A is termed fc-bounded (resp. linearly-bounded, polynomially-bounded, 
exponentially-bounded), and denoted by fc-RA (resp. lin-RA, poly-RA, exp- 
RA). Further, the class of languages accepted by £;-RA (resp. ^n-RA, poly-RA, 
exp-RA, arbitrary RA) is denoted by k-KA (resp. CKA, VTIA, £ TLA, KA). 

Let us denote by H£Q (resp. CXN ', CF, CS, 1Z£) the class of regular (resp. 
linear context-free, context-free, context-sensitive, recursively enumerable) lan- 
guages. 

Example 3. Let L ± = {a n b n c n \n>0} and consider an RA A = (5, £, A, D Q , /) 
defined as follows: 

S = {a, b, c, d, a', b', c', /} with £ = {a, b, c}, 
A = {ai, a 2 , a 3 , a 4 }, where 

ai = (a, 66', a'), a 2 = (a'6, cc', 6'), a 3 = (6'c, 0,c'), a 4 = (d,abca'b' , /), 
A, = d. 

Then, it holds that L x = L(Ax) (see FigureS]). 

Example 4. Let L 2 = {a m b m c n d n \ m,n > 0} and consider an RA A2 — 
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Figure 4: Reaction diagram of A\ which accepts L\ = {a n b n c n \ n > 0}. 
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Figure 5: Reaction diagram of ^2 which accepts L 2 = {a m b m c n d n \ m,n> 0}. 



(S, E, A, Do, /) defined as follows: 

5" = {a, b,c,d,a',c',p ,p 1 ,p 2 ,P3,f} with E = {a, b, c,d}, 

A = {ai,a 2 ,a3,a4,a5,a 6 ,a7,a 8 ,a 9 ,aio,aii}, where 

»i = (ap ,bc,a'p ), a 2 = (a'bp ,c,pi), a 3 = (a'bpi, c,pi), a 4 = (cp ,d,c'p 2 ), 
»5 = (cpi,d, c'p 2 ), a 6 = (cp 2 ,d,c'p 2 ),& 7 = (c'dp2 ) $,P3), a 8 = (cdp 3 ,(b,p 3 ), 
a 9 = (p ,abcd,f), a w = (pi,abcda', /), an = (p 3 ,abcda'c' ', /), 

-D = po- 

Then, it holds that L 2 = £(-4 2 ) (see Figure EJ. 

It should be noted that Ai and .A2 are both izn-RAs, therefore, the class 
of languages C1ZA includes a context-sensitive language L\ and a non-linear 
context-free language L 2 . 

Lemma 1. For an alphabet E with |E| > 2, let h : E* — > E* 6e an injection 
such that for any w £ E* 7 is bounded by a polynomial of \w\. Then, 

there is no polynomially-bounded reaction automaton which accepts the language 
L = {wh(w)\w e E*}. 

Proof. Assume that there is a poly-KA A — (S, E, A, Do, /) such that L(A) = 
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{wh(w) | w G E*}. Let \S\ = mi, E| = m.2 > 2 and the input string be wh(w) 
with | to | = n. 

Since ^ s bounded by a polynomial of \w\, \wh(w) \ is also bounded by a 

polynomial of n. Hence, for each Di in an interactive process 7r G IP (A, wh(w)), 
it holds that |A < p(n) for some polynomial p(n) from the definition of a pofo/- 



(miHfc denotes the number of repeated combinations of mi things taken fc at a time.) 

Therefore, there is a polynomial p'(n) such that |£> p („)| = p'(n). Since it holds 
that |E™| = (m2) n , if n is sufficiently large, we obtain the inequality |2? p ( n )| < 



For i > and w G E*, let = {A G V p{tl) \tt = D , . . . , D u . . . G 

IP(A,w)} C T> p f n \, i.e., /i(ui) is the set of multisets in which appear 

as the z'-th elements of interactive processes in IP(A,w). From the fact that 
= {wh(w) \w G E*} and h is an injection, we can show that for any 
two distinct strings w\ 1 W2 G E™, I n {w\) and I n {w2) are incomparable. This is 
because if I n (wi) C I n {w2), the string W2h(w\) is accepted by .4, which means 
that h{w\) = h[u>2) and contradicts that ft is an injection. 

Since for any two distinct strings w\,W2 G E", I n {w\) and I n (w2) are in- 
comparable and I n (wi), I n (u)2) C T> p r n \, it holds that 



However, from the pigeonhole principle, the inequality |{/„(w) u; G E™} < E™ 
contradicts that for any two distinct strings w\,W2 G E™, I n {w\) ^ I n {w2)- d 

Theorem 3. T7ie following inclusions hold : 

(1) . 7e££ = C £7e^ C 7^.4 G ClZA = n£ (for each fc > 1). 

(2) . OlA C CSC SUA. 

(3) . CXM (CJ-) and CR.A are incomparable. 

Proof. (1). From the definitions, the inclusion 1Z£Q C 1-TZA is straightforward. 
Conversely, for a given fc-RA A = (S, E, A, Do, /) and for w G £(*4), there exists 
a 7r in IP(A, w) such that for each Di appearing in n, we have \Di\ < fc. Let 
Q = {D G S* | |D| < fc} and F = {D \ D G Q,f C D,Res A (D) = {£>}}, 
and construct an NFA M = (Q, E, <5, Z? : -F), where <5 is defined by <5(D, a) 9 £>' 
if L> D' for a G E U {A}. Then, it is seen that L(A) = L{M), and fc- 
TZA C TZ£Q, thus we obtain that 7*L£ (/ = k-lZA. The other inclusions are all 
obvious from the definitions. The language L = {a n b n n > 0} proves the 
proper inclusion : 1Z£ Q C £7\L4. A proper inclusion VTZA G £1ZA is due to 
that L 3 = {ww R \ w <E {a, b}*} G SUA - VTZA, which follows from LemmaQ] 
(2). Given an lin-KA A, one can consider a linearly bounded automaton (LB A) 



RA. 



Let V p{n) = {D G S* \D\ < p(n)}. Then, it holds that 




p(n) 



p(n) 



{I n (w) \w€X n }\ < \V p{n) \ < |E"|. 
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Figure 6: Language class relations in the Chomsky hierarchy : L\ = {a n b n c n \ 
n > 0}; L 2 = {a m b m c n d n \m,n> 0}; L 3 = {ww R | w £ {a, &}*} 

M that simulates an interactive process tt in IP(A,w) for each w, because of 
the linear boundedness of A. This implies that C1ZA C CS. A proper inclusion 
is due to that L3 = {ww R \ w £ {a,b}*} £ CXM — LIZA, which follows from 
Lemma [1] 

Further, for a given LBA M, one can find an equivalent two-stack machine 
M s whose stack lengths are linearly bounded by the input length. This implies, 
from the proof of Theorem [TJ that M s is simulated by an RA A that is expo- 
nentially bounded. Thus, it holds that CS C £1ZA. 

(3). The language L 1 = {a n b n c n | n > 0} (resp. L 2 = {a m b m c n d n \m,n> 0}) 
is in CRA — CT (resp. C1ZA — CXAf), while, again from LemmaQ] the language 
L3 is in LXM — CTZA. This completes the proof. □ 



7 Concluding Remarks 

Based on the formal framework presented in a series of papers [51 [SI EH HI] > 
we have introduced the notion of reaction automata and investigated the lan- 
guage accepting powers of the automata. Roughly, a reaction automaton may 
be characterized in terms of three key words as follows : a language accepting 
device based on the multiset rewriting in the maximally parallel manner. Specif- 
ically, we have shown that in a computing schema with one-pot solution and a 
finite number of molecular species, reaction automata can perform the Turing 
universal computation. The idea behind their computing principle is to simu- 
late the behavior of two pushdown stacks in terms of multiset rewriting with 
the help of an encoding technique, where both the manner of maximally parallel 
rewriting and the role of the inhibitors in each reaction are effectively utilized. 

There already exist quite a few number of literature investigating on the 
notion of a multiset and its related topics ([2]) in which multiset automata 
and grammars are formulated and explored largely from the formal language 
theoretic point of view. Rather recent papers ( jTHJ [17]) focus on the accepting 
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power of multiset pushdown automata to characterize the classes of multiset 
languages through investigating their closure properties. 

To the authors' knowledge, however, relatively few works have devoted 
to computing languages with multiset rewriting/communicating mechanism. 
Among them, one can find some papers published in the area of membrane 
computing (or spiking neural P-systems) where a string is encoded in some 
manner as a natural number and a language is specified as a set of natural num- 
bers (e.g., [3])- Further, recent developments concerning P-automata and its 
variant called dP-automata are noteworthy in the sense that they may give rise 
to a new type of computing devices that could be a bridge between P-system 
theory and the theory of reaction systems and automata ([51 Q3J 

In fact, a certain number of computing devices similar to reaction automata 
have already been investigated in the literature. Among others, parallel labelled 
rewrite transition systems are proposed and investigated Ql2j) in which multi- 
set automata may be regarded as special type of reaction automata, whereas 
neither regulation by inhibitors nor maximally parallel manner of applying rules 
is employed in their rewriting process. A quite recent article [T] investigates the 
power of maximally parallel multiset rewriting systems (MPMRSs) and proves 
the existence of a universal MPMRS having smaller number of rules, which 
directly implies the existence of a universal antiport P-systems, with one mem- 
brane, having smaller number of rules. In contrast to reaction automata, a 
universal MPMRS computes any partially recursive function provided that the 
input is the encoding of a register machine computing a target function. 

Turning to the formal grammars, one can find random context grammars 
([6]) and their variants (such as semi-conditional grammars in |18j ) that em- 
ploy regulated rewriting mechanisms called permitting symbols and forbidding 
symbols. The roles of these two are corresponding to reactants and inhibitors 
in reactions, whereas they deal with sets of strings (i.e., languages in the usual 
sense) rather than multisets. We finally refer to an article on stochastic com- 
puting models based on chemical kinetics, which proves that well-mixed finite 
stochastic chemical reaction networks with a fixed number of species can achieve 
Turing universal computability with an arbitrarily low error probability f [22]). 
In this paper, we have shown that non-stochastic chemical reaction systems with 
finite number of molecular species can also achieve Turing universality with the 
help of inhibition mechanism. 

Many subjects remain to be investigated along the research direction sug- 
gested by reaction automata in this paper. First, it is of importance to com- 
pletely characterize the computing powers and the closure properties of com- 
plexity subclasses of reaction automata introduced in this paper. Secondly, from 
the viewpoint of designing chemical reactions, it is useful to explore a method- 
ology for "chemical reaction programming" in terms of reaction automata. It is 
also interesting to simulate a variety of chemical reactions in the real world by 
the use of the framework of reaction automata. 
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